Toward fault-tolerant parallel-in-time integration with PFASST
نویسندگان
چکیده
منابع مشابه
Toward fault-tolerant parallel-in-time integration with PFASST
We introduce and analyze different strategies for the parallel-in-time integration method PFASST to recover from hard faults and subsequent data loss. Since PFASST stores solutions at multiple time steps on different processors, information from adjacent steps can be used to recover after a processor has failed. PFASST’s multi-level hierarchy allows to use the coarse level for correcting the re...
متن کاملFault-Tolerant Parallel Programming with Atomic Actions
The Pact (parallel actions) parallel programming environment provides an easy-to-use parallel execution and synchronization model based on task parallelization. To give the programmer an abstraction for global data (even on distributed memory machines) the Pact runtime system uses virtual shared memory. Execution’s efficiency is improved with data-dependent dynamic load balancing and latency-ma...
متن کاملToward Fault-Tolerant Adaptive Real-Time Distributed Systems
A monitoring approach to the problem of constructing fault-tolerant and adaptive real-time systems, based on the fail-signal processor, is described. Low error detection latency time is a primary goal. A fail-signal processor comprises an application processor along with a simple monitoring processor that detects abnormal functional or timing behaviour in the application processor; on such a fa...
متن کاملInterweaving PFASST and Parallel Multigrid
The parallel full approximation scheme in space and time (PFASST) introduced by Emmett and Minion in 2012 is an iterative strategy for the temporal parallelization of ODEs and discretized PDEs. As the name suggests, PFASST is similar in spirit to a space-time FAS multigrid method performed over multiple time-steps in parallel. However, since the original focus of PFASST has been on the performa...
متن کاملParallel Fault Tolerant Robot Control
Most robot controllers today employ a single processor architecture As robot control requirements become more complex these serial controllers have di culty providing the desired response time Additionally with robots being used in environments that are hazardous or inaccessible to humans fault tolerant robotic systems are particularly desirable A uniprocessor control architecture cannot o er t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Parallel Computing
سال: 2017
ISSN: 0167-8191
DOI: 10.1016/j.parco.2016.12.001